Stencil kernels dominate a range of scientific applications, including seismic and medical imaging, image processing, and neural networks. Temporal blocking is a performance optimization that aims to reduce the required memory bandwidth of stencil computations by re-using data from the cache for multiple time steps. It has already been shown to be beneficial for this class of algorithms. However, applying temporal blocking to practical applications' stencils remains challenging. These computations often consist of sparsely located operators not aligned with the computational grid (“off-the-grid”). Our work is motivated by modelling problems in which source injections result in wavefields that must then be measured at receivers by interpolat...
This doctoral project is about the solution of inverse problems on hyperbolic PDEs. It includes work...
We introduce Devito, a new domain-specific language for implementing high-performance finite-differe...
Simulations based on stencil computations (widely used in geosciences) have been dominated by the MP...
Stencil computations are a key part of many high-performance computing applications, such as image p...
The growth of data to be processed in the Oil & Gas industry matches the requirements imposed by evo...
AbstractTemporal blocking is a class of algorithms which reduces the required memory bandwidth (B/F ...
A widely used class of codes are stencil codes. Their general structure is very simple: data points ...
AbstractIn this paper, we use our stencil code generation and auto-tuning framework Patus to optimiz...
New algorithms and optimization techniques are needed to balance the accelerating trend towards band...
The key common bottleneck in most stencil codes is data movement, and prior research has shown that ...
pre-printStencil computations are a common class of operations that appear in many computational sci...
Wave-based analog signal processing holds the promise of extremely fast, on-the-fly, power-efficient...
International audienceStencil computations are the basis to solve many problems related to Partial D...
Publication rights licensed to ACM. Sparse tiling is a technique to fuse loops that access common da...
Current representation learning methods in Spiking Neural Networks (SNNs) rely on rate-based encodin...
This doctoral project is about the solution of inverse problems on hyperbolic PDEs. It includes work...
We introduce Devito, a new domain-specific language for implementing high-performance finite-differe...
Simulations based on stencil computations (widely used in geosciences) have been dominated by the MP...
Stencil computations are a key part of many high-performance computing applications, such as image p...
The growth of data to be processed in the Oil & Gas industry matches the requirements imposed by evo...
AbstractTemporal blocking is a class of algorithms which reduces the required memory bandwidth (B/F ...
A widely used class of codes are stencil codes. Their general structure is very simple: data points ...
AbstractIn this paper, we use our stencil code generation and auto-tuning framework Patus to optimiz...
New algorithms and optimization techniques are needed to balance the accelerating trend towards band...
The key common bottleneck in most stencil codes is data movement, and prior research has shown that ...
pre-printStencil computations are a common class of operations that appear in many computational sci...
Wave-based analog signal processing holds the promise of extremely fast, on-the-fly, power-efficient...
International audienceStencil computations are the basis to solve many problems related to Partial D...
Publication rights licensed to ACM. Sparse tiling is a technique to fuse loops that access common da...
Current representation learning methods in Spiking Neural Networks (SNNs) rely on rate-based encodin...
This doctoral project is about the solution of inverse problems on hyperbolic PDEs. It includes work...
We introduce Devito, a new domain-specific language for implementing high-performance finite-differe...
Simulations based on stencil computations (widely used in geosciences) have been dominated by the MP...